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Abstract — Interleaved Reed-Solomon codes are applied in 
numerous data processing, data transmission, and data storage 
systems. They are generated by interleaving several codewords 
of ordinary Reed-Solomon codes. Usually, these codewords are 
decoded independently by classical algebraic decoding methods. 
However, by collaborative algebraic decoding approaches, such 
interleaved schemes allow the correction of error patterns beyond 
half the minimum distance, provided that the errors in the 
received signal occur in bursts. In this work, collaborative 
decoding of interleaved Reed-Solomon codes by multi-sequence 
shift-register synthesis is considered and analyzed. Based on 
the framework of interleaved Reed-Solomon codes, concatenated 
code designs are investigated, which are obtained by interleaving 
several Reed-Solomon codes, and concatenating them with an 
inner block code. 

Index Terms — Interleaved Reed-Solomon codes, homogeneous 
IRS codes, heterogeneous IRS codes, concatenated codes, collab- 
orative decoding, shift-register synthesis, multiple sequences 



I. Introduction 

In recent years, code designs with interleaved Reed- 
Solomon (IRS) codes were the topic of several scientific 
publications. They are investigated by different authors like 
Krachkovsky, Lee, and Garg [1], [2], [3], Bleichenbacher, 
Kiayias, and Yung [4], Brown, Minder and Shokrollahi [5], [6], 
Justesen, Thommesen, and H0holdt [7], as well as Parvaresh 
and Vardy [8]. Moreover, IRS codes are considered in [9], 
[10], [11], and other publications. Interleaved Reed-Solomon 
codes are mainly considered in applications where error bursts 
occur, since IRS codes are most effective if correlated errors 
affect all words of the interleaved scheme simultaneously. 
In [3], [7], and [10], it is suggested to consider IRS codes 
also as outer codes in concatenated code designs, since even 
for channel models inducing statistically independent random 
errors, the decoder for the inner code will usually create 
correlated burst errors at the input of the decoder for the outer 
codes. For decoding IRS codes, Bleichenbacher, Kiayias, and 
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Yung [4] propose an algorithm based on the Welch-Be rlekamp 
approach. Parvaresh, and Vardy [8] consider IRS decoding 
in the context of multivariate polynomial interpolation, which 
yields a quite powerful list decoding algorithm. 

In this paper, we consider IRS codes in a rather general 
way. More precisely we consider IRS codes consisting of I 
Reed-Solomon codes -4 (2) , . . . , A^ of length N and 
dimensions K^ x \ K^ 2 \ . . . ,K^ 1 '. We take a codeword from 
each code A^K £ = 1, . . . , I, and arrange them row- wise into 
a matrix like depicted in Fig. 1. We call the set of matrices 
obtainable in this way an IRS code. If A (1) = A^ = ■ ■ ■ = 
we call the code homogeneous IRS code, if the codes 
are different, we call the resulting IRS code heterogeneous. 

Most previous publications consider only homogeneous IRS 
codes. Heterogeneous constructions have first been considered 
in [3] (without calling them heterogeneous IRS codes), and 
some of their properties have been investigated in [12]. Het- 
erogeneous IRS codes may be interesting, e.g. when used as 
outer codes of generalized concatenated codes introduced and 
described by Blokh and Zyablov [13] and by Zinoviev [14]. 
Another interesting application of heterogeneous IRS codes is 
the decoding of a single Reed-Solomon code beyond half the 
minimum distance like described in [15], where the problem of 
decoding a single low-rate Reed-Solomon code is transformed 
into the problem of decoding a heterogeneous IRS code. 

We propose a method for collaborative decoding of both 
homogeneous and heterogeneous IRS codes, which is based 
on multi-sequence shift-register synthesis. To analyze the 
behavior of this decoder, we derive bounds on the decoding 
error and decoding failure probability. These bounds allow for 
estimating of the gain, which can be achieved by collaborative 
decoding in comparison to decoding the I Reed-Solomon 
codes independently up to half the minimum distance by a 
standard Bounded Minimum Distance (BMD) decoder. 

If one column of the arrangement shown by Fig. 1 is cor- 
rupted by a burst error, i.e., an error which affects a complete 
column of the arrangement, the I Reed-Solomon codewords 
may have an erroneous symbol at the same position. Hence, a 
collaborative decoding strategy can be applied, which locates 
the errors jointly in all Reed-Solomon codewords instead of 
locating them independently in the several words. This allows 
for uniquely locating up to t errors, in many cases even if t is 
larger than half the minimum distance of the Reed-Solomon 
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Fig. 1. Interleaved Reed-Solomon code. 



code with the largest dimension. 

Algebraic decoding of a single Reed-Solomon codeword 
can efficiently be performed by the Berlekamp-Massey al- 
gorithm [16], [17], which is based on a single-sequence 
shift-register synthesis approach. It is mentioned in [2] and 
[3], that decoding of interleaved Reed-Solomon codes can 
be performed on the basis of a multi-sequence shift-register 
synthesis algorithm. Such an algorithm is described by Feng 
and Tzeng in [18] and [19]. For homogeneous IRS codes, 
this algorithm provides an effective method for collaborative 
decoding. However, it is shown in [20] and [21] that the 
Feng-Tzeng algorithm does not always work correctly for 
sequences of varying length. Thus, it cannot be applied for 
the heterogeneous case, since the different dimensions of the 
codes A^\ A^ 2 \ . . . , A^ result in syndrome sequences of 
different lengths. Hence, we use the multi-sequence shift- 
register synthesis algorithm proposed in [20] to decode both, 
homogeneous and heterogeneous IRS codes. The complexity 
of this shift-register based approach is similar to the complex- 
ity of applying the Berlekamp-Massey algorithm to all words 
of the interleaved Reed-Solomon code independently. 

In the following sections, we describe the basic principles 
behind collaborative IRS decoding. Based on this, we derive 
the maximum correction radius. Furthermore, we discuss how 
the code properties and the decoding performance is influenced 
by the IRS code design, i.e., by the choice of the parameters 
of A 1 - \A-- 2 \ . . . ,A"\ Moreover, we derive bounds on the 
failure and error probability, which allow us to estimate the 
performance gain, which is achievable by collaborative decod- 
ing. On the basis of these results, we investigate concatenated 
code designs with outer IRS codes and inner block codes, 
and derive bounds on the overall decoding performance. These 
considerations are supplemented by Monte Carlo simulations, 
to assess the quality of our bounds. 

II. Interleaved Reed-Solomon Codes 

As already mentioned, the codewords of an IRS code are 
matrices whose rows are the codewords of the Reed-Solomon 
codes A (1 \A( 2 \ . . . ,A {1) . Before we formally define IRS 
codes, we briefly introduce classical Reed-Solomon codes, 
basically to establish the notation we use in the following. 

Reed-Solomon codes can conveniently be defined in the 
frequency domain, i.e., by the Fourier Transform of their code- 
words. For this purpose, we consider the following definition 
of the Discrete Fourier Transform over finite fields: 
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Definition 1 (Discrete Fourier Transform (DFT) over ¥ q ) 

Let p(x) = po + Pix + • • • + Pn-ix 11 ^ 1 be a polynomial of 
degree dcg (p(x)) < n — 1 with coefficients from ¥ q . Further, 
let a £ ¥ q be some element of order n. Then, the polynomial 

P(x) = & (p(x)) =P + P lX + ... + P n ^x n - X 

whose coefficients are calculated by Pi = p(a l ) is called the 
Discrete Fourier Transform ofp(x) over the field ¥ q . 

The inverse of the Discrete Fourier Transform pursuant to 
Definition 1 can be calculated in the well-known way as 
follows: Let P(x) be a polynomial of deg(P(x)) < n — 1, 
and denote by 

p(x) = (P{X)) = po + Pl X + ■ ■ ■ + Pn-lX™- 1 

the Inverse Discrete Fourier Transform. Then, the coefficients 
Pi of p(x) are calculated by pi = n~ 1 P(a~ l ). 

In the terminology of the Fourier Transform, we formally 
call p(x) time domain polynomial and P(x) the frequency 
domain polynomial or the spectrum of p(x). We now describe 
Reed-Solomon codes based on the Discrete Fourier Trans- 
form. The following definition does not describe the class of 
Reed-Solomon codes in its most general way. However, with 
regard to a concise notation, it is adequate for our purposes: 

Definition 2 (Reed-Solomon (RS) code) 

Let 

{A{x)} = ^Y^A iX \ A 4 eF 9 j 



be the set of all polynomials of deg (A(x)) < K with coeffi- 
cients Ai from ¥ q , and let a £ ¥ q be some element of order 
N < q. Then, a Reed-Solomon code A = US (q; N, K, D) 
of length N , dimension K, and minimum Hamming distance 
D = N — K + 1 can be defined as the set of polynomials 

A 4 { a (x) = &~ x (A(x)) | A(x) £ {A(x)}} . 

The codewords of A are represented by the polynomials a(x) = 
ao + a\x + • • • + a„_i.T n_1 , or alternatively by the n-tuples 
a = (a , ai, . . . , a„-i). 

Reed-Solomon codes fulfill the Singleton Bound with equality, 
and are therefore Maximum Distance Separable (MDS). This 
means that from any set of K correct symbols, a Reed- 
Solomon codeword can uniquely be reconstructed. 

An interleaved Reed-Solomon code is now obtained by 
taking I Reed-Solomon codes according to Definition 2 and 
grouping them row-wise into a matrix: 

Definition 3 (Interleaved Reed-Solomon (IRS) code) 

LetA^ = nS(q;N,K^ l \D^), I = I,..., I, be I Reed- 
Solomon codes of length N according to Definition 2. Then, an 
interleaved Reed-Solomon code is the set of matrices 
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If all I Reed-Solomon codes are equivalent, i.e., A^ = A^ = 

■ ■ ■ = A^ = A, the IRS code is called homogeneous. 
Otherwise, we say that the IRS code is heterogeneous. 

Instead of considering an IRS codeword as I x N matrix 
A with elements from the extension field ¥ q , we can also 
consider it as a row vector a with elements from the extension 
field ¥ q i. In this extension field representation, an IRS code 
is a code of length N, cardinality Y\[ =1 <z fc< and minimum 
distance mirii<£< ; over ¥ q i. Hence, for the 

homogeneous case, i.e., if A™> = 1ZS (q; N, K, D), £ = 
1, . . . , I, we obtain a code of length N, dimension K, and 
minimum distance D = N — K + 1 over F^i . This means 
that homogeneous IRS codes fulfill the Singleton Bound with 
equality and are therefore MDS. However, since the minimum 
distance of a heterogeneous IRS code is determined by the 
weakest Reed-Solomon code, i.e., the Reed-Solomon code 
with the smallest minimum distance, heterogeneous IRS codes 
are generally not MDS. 

III. Collaborative Decoding of Interleaved 
Reed-Solomon Codes 

Before we explain the concept of collaborative interleaved 
Reed-Solomon decoding, we consider the conventional case of 
a single Reed-Solomon code A = 1ZS (q; N, K, D). Decoding 
is usually performed by an algebraic Bounded Minimum Dis- 
tance (BMD) decoder, which corrects all errors with weights 
up to half the minimum code distance. 

A. BMD Decoding up to Half the Minimum Distance 

Assume that a codeword a(x) £ A is transmitted over 
a noisy channel, which adds an error polynomial e(x) = 
e + e\x + • • • + eN-ix N ~ 1 over ¥ q , so that we observe 
the word r(x) = a(x) + e(x) at the output of the channel. If 
e(x) = 0, the Discrete Fourier Transform R(x) = & (r(x)) = 

Ro + R\x H h Rn-ix 1 ^^ 1 will be a polynomial of degree 

smaller than K due to Definition 2, i.e., the coefficients 
Rk, ■ ■ ■ , Rn-i will be zero. However, if e(x) ^ A, we obtain 
deg(i?(x)) > K, which means that some of the coefficients 
Rk, ■ ■ ■ , Rn-i are non-zero. These coefficients only depend 
on the transformed error E(x) = & (e(x)) — Eq + E\x + 

■ ■ ■ + En-ix 1 * 1 ^ 1 . Hence, we can consider the coefficients 
Rk, ■ ■ ■ , Rn-i as syndrome coefficients and denote them by 
Sj = R K+j = E K+j , j = 0,...,N-K-l. 

To correct t errors, the standard approach for algebraic 
Reed-Solomon decoding is to define the polynomial \(x) = 
Ao + Aix + - • ■ + \n-ix n ~ 1 , such that it has a zero coefficient 
Xj = 0, whenever the corresponding coefficient e 3 of the 
error polynomial e(x) is non-zero. For all error free positions, 
i.e., for all positions for which ej = 0, the corresponding 
coefficient Xj is defined to be non-zero. Consequently, Xj-ej = 
holds for all positions j — 0, . . . ,N— 1. Due to the properties 
of the Discrete Fourier Transform, this relation is transformed 
into 

A(x) ■ E(x) = mod x N - 1 . (1) 

The spectrum A(x) is a polynomial of degree t, whose roots 
oT^ 1 , oT^ , . . . , aT correspond to the locations ji, j 2 , ■ ■ ■ , jt 
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of the erroneous symbols. Therefore, A(x) = Ao + Aix + 
■ ■ • + A t x* is called error locator polynomial. An error locator 
polynomial was first applied by Peterson [22] for decoding 
BCH codes. 

Since the roots of the error locator polynomial are not 
modified by multiplying A(x) by a constant factor, and since 
Ao 7^ 0, A(x) can always be normalized w.l.o.g. in such 
a way that A = 1. Equation (1) forms a linear system of 
N equations. In this system, t equations only depend on the 
M = N — K known syndrome coefficients So, ... , Sm-i and 
the unknown coefficients Ai, . . . , A t . With these t equations, 
we write the matrix equation 
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which is a linear system of N — K — t equations and t 
unknowns. Hence, (2) cannot have a unique solution, if t > 
N ~ K and we are never able to correct more than [ N ~ K j 
errors. If t < L^^J , solving (2) yields a unique error locator 
polynomial, and hence also the locations of the erroneous 
symbols. Basically, (2) can be solved by standard methods 
from linear algebra. This approach, also known as Peterson 
Algorithm, is described in [23]. However, today the Peterson 
Algorithm is of minor practical relevance since there exist 
algorithms which utilize the structure of (2) to solve the system 
of equations in a more efficient way. 

To understand how the structure of (2) can be exploited, we 
observe that we are able to state (2) in the form of the linear 
recursion 

t 

Si = -^2AjSi-j, i = t,...,N-KW-l (3) 

3=1 

of length t. In this way, the problem of calculating A(x) is 
transformed to the problem of finding the smallest integer t and 
the connection weights Ai, . . . , A t for recursively generating 
the syndrome sequence S = {Sj}^ -1 . This problem is 
equivalent to the problem of determining the shortest possible 
linear feedback shift-register, which is capable of generating 
the syndrome sequence S. This single-sequence shift-register 
synthesis problem is solved very efficiently by the Berlekamp- 
Massey algorithm [16], [17]. 

When A(x) is determined, and hence the erroneous posi- 
tions are known, the most difficult part of decoding is accom- 
plished. If the error locations are known, the error values can 
always be uniquely determined, as long as t < N — K. Error 
evaluation can be performed using several standard techniques 
like Recursive Extension [24] or the Forney algorithm [25]. 

B. Collaborative Error Location 

Now, we consider an interleaved Reed-Solomon code pur- 
suant to Definition 3. We assume that we observe a received 
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word 



R 



/ r (i)\ 

r (2) 

y r (0 / 



/ a(1) \ 

a( 2 ) 



e (2) 



where the £th row of R consists of the coefficients of the re- 
ceived word (x) = a^) (x)+e^ (x). Each error polynomial 
e {t \x) has < t non-zero coefficients located at the indices 
{i 1 ,i 2 ,...,i t (i)} C {ji,j 2 ,---,jt}- Hence, R = A + E 
can be written as sum of a codeword A e A. and an error 
matrix E with exactly t non-zero columns. For each row in 
R, we are able to calculate a syndrome sequence = 
{^f Ito^ _1 - However, since we allow the codes A^ 
to have different dimensions the calculated syndrome 

sequences may be of different lengths. Nevertheless, as long 
as t < N - KV\ we are able to create the (N - -t)xt 
matrix 



S W 



( 



s\ 
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\°N-K(t)-t-l 
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°N-KW-t 



and a vector 
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\ D N-K(V-1/ 

for each sequence I — We use the matrices 



S (1) ,S (2) ,...,S (0 , and the vectors T (1) ,T (2) , 
state the linear system of equations 



-,(0 
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V Ax J \ T «)J 



Si A T, 

with i unknowns. Note that if t = N — for some Reed- 
Solomon code A^\ we are still able to state a system of 
equations similar to (4), simply by skipping the corresponding 
matrix and the corresponding vector Moreover, it 
is senseless to consider the case t > N — since we are 

not able to uniquely reconstruct the Reed-Solomon codeword 
a^\x) from less than uncorrupted symbols. 

C. Error Location by Multi-Sequence Shift-Register Synthesis 

The basic structure of (4) is similar to the structure of (2). 
Thus, similar as before, we create the set of linear recursions 

S^ = -Y j K j s\%i = t,...,N-K^-l, £ = l,...,l, 



of length t. All I linear recursions use the same connection 
weights Ai, . . . , A t to combine the syndrome coefficients of 
the I received words. Hence, the error locator polynomial 
A(x) can be calculated by finding the smallest integer t and 
the connection weights Ai, . . . , A t for recursively generating 
all I different syndrome sequences = {Si}^, K _1 . 

This is equivalent to synthesizing the shortest linear feedback 
shift-register capable of generating the I syndrome sequences 
S^ 1 ' , . . . ,S"\ For a homogeneous Reed-Solomon code, i.e., 
if all I sequences have the same length, this multi-sequence 
shift-register problem can be solved by the Feng-Tzeng al- 
gorithm [18], [19]. However, for a heterogeneous IRS code, 
the sequences may be of different length. It is 

demonstrated in [20] and [21] that the Feng-Tzeng algorithm 
does not always yield the shortest shift-register for varying 
length sequences. Hence, we apply the varying length shift- 
register synthesis algorithm proposed in [20] to calculate 
A(x) for both homogeneous and heterogeneous IRS codes. 
A pseudo-code description of this algorithm is given by 
Algorithm 1. 

Algorithm 1: Shift-Register Synthesis Algorithm from [20] 
input: SW={S^}? = - K(e) -\e=l,...,l 

M <- maxi<^< ; {N - K^} 

MW<-N-KW,£=l,...,l 
t <- 0, A(x) <- 1 

m W <— M — M <- 0, for £ = 1, . . . , I 
AW(x) <- 0, A< £ ) <- 1, for £= 
for each m from to M — 1 do 
for each I from 1 to I do 

\im-t> M - AfW then 

(t) a cW 



4-V 1 A ■ <? w 



'I 

if A ^ then 

if m - to^ < t - then 

L A(x)^A(x)-^yAW(x)x 

else 

t*-t, A(x) <- A(x) 
A(x)^A(x)-^){x)x 
t ^m-(mW-tW) 
t W <_£ AW(i)*-A(i) 
A^ <- A, m< £ ) <- m 



output: t, A(x) 



Applying Algorithm 1 to the I syndromes S^\...,S^ 
yields a polynomial A(x) and a shift register length t. How- 
ever, by the definition of the error locator polynomial, A(x) 
is only a valid error locator polynomial, if it has exactly t 
distinct roots. Hence, we accept a A-polynomial obtained from 
Algorithm 1 only, if it conforms to the following definition: 



Definition 4 (t-valid A-polynomial) 

A polynomial A(x) over ¥ q is called t-valid, if it is a polyno- 
(5) miai of degree t and possesses exactly t distinct roots in ¥ q . 
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Once a i-valid error locator polynomial A(x) is successfully 
calculated for an IRS code, error evaluation can be performed 
independently for all / Reed-Solomon codewords by the 
standard techniques also used for classical BMD decoding. 

Using Algorithm 1 for calculating A(x) yields the same 
computational complexity as locating the errors in the I code- 
words independently by the Berlekamp-Massey algorithm. 
Consequently, we are able to increase the error correction 
radius above half the minimum distance of the Reed-Solomon 
code with the smallest distance, without increasing the decod- 
ing complexity. 

Altogether, our collaborative decoding strategy for IRS 
codes consists of four steps. First, we calculate I syndromes 
<SW , . . . , S( l \ next, we synthesize an error locator polynomial 
A(x), then we check whether this polynomial is t-valid, and 
whether t is smaller than some maximum error correcting 
radius i max , which will be specified later. If we have a t- 
valid A-polynomial, we calculate values of the errors e^\x) 
independently for all received words (x), and obtain esti- 
mates 

a,W(x) =rW(x)-eW(x) 

for I — 1, . . . , I. If A(x) is not i-valid, we get a decoding 
failure. The complete decoding algorithm is summarized by 
Algorithm 2. 



Algorithm 2: Collaborative IRS Decoder 



input: received word R = I ; 

V r ('o / 

use DFT to calculate syndromes ,.. ., 
synthesize t, A(x) by Algorithm 1 
if t < i max and A(x) is t-valid then 
for each I from 1 to I do 

evaluate errors, and calculate 
L calculate = — 

else 

|_ decoding failure 
output: A = I e A. or decoding failure 



Generally, depending on the errors added by the channel, 
Algorithm 2 may yield three different results: 

1) The algorithm may obtain a correct result, i.e., (x) = 

aW{x)W= 

2) The algorithm may obtain an erroneous result, i.e., 3£ S 
[I,..., I] :aW(x)^aW(4 

3) The algorithm may not yield a result at all, i.e., it may 
yield a decoding failure. 

All three events occur with a certain probability. In the 
following, we denote the probability for a correct decision by 
P c , the probability for an erroneous decision by P e , and the 
probability for a decoding failure by Pf. Hence, the probability 
P w for obtaining a wrong decoding result is calculated by 

P w = l-P c = Pe+Pf- 
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D. Joint Error and Erasure Correction 

In [11], an algorithm for joint error and erasure decoding 
of IRS codes is proposed. Like the algorithm presented in 
[4] for decoding errors only, the algorithm from [11] uses a 
Welch-Be rlekamp approach, which is based on solving a linear 
system of equations. We briefly explain how Algorithm 2 can 
be modified to allow joint error and erasure correction based 
on shift-register synthesis. 

Technically, erasures occur if the receiver is not able to 
detect any meaningful symbol at the output of the channel. 
This means that the decoder does not know the value, but the 
position of an erased symbol. Formally, we define an erasure 
as follows: Let £ denote an erased symbol, and define it to be 
a special symbol £ ^ ¥ g . Moreover, let the addition of a € V q 
and £ be defined by a + £ = £ + a = £. Now, assume that we 
observe a received word 

R= A + E + S , 

where the rows of the matrix 




w 

are the coefficients of the erasure polynomials e^\x) = e<p + 
e[ e) x + ■■■ + efL^- 1 over {0, £}, I = 1, . . . , I. Each non- 
zero element in £ indicates an erased symbol. In contrast to 
errors, the erased symbols are detectable in the received word 
R, which means that they do not have to be located, only 
their values have to be determined. For this reason, we do not 
require the erasures to occur in column bursts. 

To perform joint error and erasure decoding for IRS codes, 
we proceed like described in [26] and define an erasure locator 
polynomial 

¥ e \x) 4 (l - A) (l - A) . . . (i _ Jlx) = 

= 1 + + ^ ] x 2 + ■■■ + *gV< 

for each of the rows I = 1, in our IRS code. The 
roots a 4 i , a 4 2 , . . . , a ^ indicate the positions of the 
erasures in the erasure polynomial e^ e \x), i.e., the coefficients 
of e^\x) at the positions i^\4^ ■ • • > ^ e q ua l to £, and 
all other coefficients are zero. Now we consider the received 
polynomial r^\x) whose coefficients are the £th row of R. 
From r^\x) we create the polynomial f^\x) by replacing 
all erasure elements by an arbitrary field element. Then, we 
calculate R^ l \x) = & (f^(x)) and RW(x) = R^{x)^{x). 
It is explained in [26] that R {t \x) = k[ l) + R[ e) x + ■■■ + 
R^-iX 71 ^ 1 is the spectrum of a modified received word (x) 
with errors at the same positions as r^\x). Moreover, the last 
N—K^—'di coefficients of R^(x) are only influenced by the 
errors, so that we are able to obtain the modified syndromes 
SW = m^- 1 -*^ where sf> = ft$ w ^ i = 
0, . . . , N - - & t - 1, I = 1, . . . , I. This means that 
each erasure symbol in the row I reduces the length of the 
usable syndrome by one. In this way, we obtain I (varying 
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length) syndrome sequences, which we feed into Algorithm 1 
to calculate a pair t, A(x). Once, a t-valid polynomial A(x) 
is calculated and t < t max , we proceed like described in 
[26] to obtain the error and erasure values independently for 
each row, using the error and erasure locator polynomial 

In some sense, error and erasure decoding of an IRS code 
is equivalent to error only decoding of a heterogeneous IRS 
code, since in both cases we process varying length syndrome 
sequences. For this reason, we do not further consider the error 
and erasure case explicitly, but keep in mind that it is closely 
related to the case of heterogeneous IRS codes. 

IV. Error Correcting Radius 

If the number of errors t is smaller than half the mini- 
mum distance of the Reed-Solomon code with the smallest 
redundancy, we are always able to correct all errors by solving 
(3) separately for all I Reed-Solomon words using standard 
techniques like the Berlekamp-Massey algorithm. Certainly, it 
is important that Algorithm 2 is also capable of decoding all 
errors in this case. To prove that Algorithm 2 actually has this 
property, we first consider the following lemma: 



Lemma 1 Let v, and w be two vectors in F™, such that the 
Hamming weights of v and w satisfy wt (v) < q — 1 or 
wt (w) < q — 1. Then, there always exists an element £ ¥ q , 
7^ 0, such that the support of v + 0w satisfies 



supp (v + 0w) = supp (v) U supp (w) . 



(6) 



Proof: Since wt (v) < q—1, or wt (w) < q—l the cardi- 
nality of the intersection V = supp (v) Hsupp (w) of the sup- 
ports of v = (vo, v\, . . . , vn-i) and w = (w , »i, • ■ • , iUff-i) 
is at most \T>\ = min{wt (v) , wt (w)} < q— 1. To obtain (6), 
Vi + fiwi ^ has to be satisfied for all i e V. In other words, 



Wi 



(7) 



Equation (7) can always be satisfied choosing a suitable 0, 
since \V\ < q — 1, and therefore there exists at least one 
^ 0, which fulfills (7). ■ 
Now, we consider a received word R = A + E, where A 
is a codeword of an IRS code and E is an error matrix. Since 
the syndromes . . . ,<S^ calculated by Algorithm 2 only 
depend on E and not on A, we can assume w.l.o.g. the case 
A = to analyze Algorithm 2. In other words, we are able 
to apply Algorithm 2 directly to the error matrix E instead of 
the received word R. 

Lemma 2 Assume that E is an I x N error matrix and L is 
a non-singular I x I matrix over ¥ q . Moreover, let E = LE 



Proof: Assume that Algorithm 2 is applied to the error 
matrix , , 



E = 



= (2) 



In its fist step, Algorithm 2 computes the Discrete Fourier 
Transform for every row of E: 

(Eg* e<P . 



/^( e(1) )\ 



E, 



(2) pP) 







P (2) 



(I) 



The last N — elements of the l\h row coincide with the 
elements of the syndrome sequence 5^. Hence, we formally 
define the syndrome matrix 



S(E) 



£ E (2) 



E 



(1) 



E 



(2) 

K( 2 ) + l 



.(1) 

KW+1 



(2) 
K< 2 >+2 



j(l) 
J N- 

E {2) 



Em^^ 



E 



? (2) g(2) 



(1) 



,(2) 



by replacing the first elements of the £th row by erasure 
symbols, i.e., by symbols £ ^ ¥ q , which fulfill a+£ = £,+a = 
£, and a ■ £ = £ • a = £ for an arbitrary element a e ¥ q . 
In other words, S(E) is an I x N matrix containing the I 
syndrome sequences aligned to the right, and prepended by 
erasure symbols. From the linearity of the Discrete Fourier 
Transform we know that 



(0 
Km 

S 

S 



p(0 / 



(1) 

N-KW-l 
(2) 



& {LE) = L& (E) 



and consequently also S(E) = LS(E). 

Since (5) is a linear recursion, we know (see e.g. [27]) that 
if the rows of S(E) satisfy (5) with some pair t, A(x), then 
the rows of LS(E) also satisfy (5) with the same pair t, 
A(x). Since L is a non-singular matrix, E = L^E, and we 
conclude that S(E) satisfies (5) with the pair t, A(x), if and 
only if S(E) satisfies (5) with the same pair t, A(x). Thus, 
if Algorithm 2 has a unique solution t, A(x) for JZ(E), then 
Algorithm 2 has the same unique solution for S(E), and vice 
versa. ■ 

Theorem 1 (Guaranteed Correcting Radius) Consider an 
interleaved Reed-Solomon code pursuant to Definition 3. 
Assume that this code is corrupted by an error matrix E with 
t non-zero columns. Then, Algorithm 2 always yields a unique 
and correct solution, i.e., all t column errors can be corrected, 
as long as t satisfies 



N -K 



be another error matrix, whose rows are linear combinations 
of the rows of E. Then, Algorithm 2 applied to E yields a 

unique pair t, A(x), if and only if Algorithm 2 applied to E t <t = — — (8) 

yields the same unique pair t, A{x). 
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where 



K = max [k^X 

Kt<l I J 



is the maximum dimension among the I Reed-Solomon codes 

4(1) 4(2) /lW 
- , . . . , «/i 

Proof: Assume that a codeword of the interleaved Reed- 
Solomon code is corrupted by an error matrix E with t non- 
zero columns. We distinguish two different cases: 

1) If wt(eW) = |supp(eW)| = t, then the Berlekamp- 
Massey algorithm applied to the sequence yields a pair 
t, A(x) which is the shortest length solution of (3), provided 
that t satisfies (8). Since we consider column errors, the pair 
t, A(x) is also a solution of (3) for the remaining sequences 
<S( 2 ), . . . ,<S^. In other words, t, A(x) is the shortest length 
solution of (5), which is also obtained by applying Algorithm 1 
to the I sequences S^ 2 \ . . . , . Consequently, Algo- 
rithm 2 yields the correct solution. 

2) Now, we consider the case wt (e^ 1 )) = |supp(eW)| < 
t, i.e., the case that the column errors only affect < t 
symbols in Since 



t < 



N -K 



<q-l 



is always satisfied due to Definition 2, we are able to recur- 
sively apply Lemma 1 to obtain I coefficients 0i, fe, ■ ■ ■ , Pi, 
(3i = 1, such that the linear combination 



is an error vector with t non-zero coefficients, i.e., we have 



supp 



(*->) 



= t. Hence, we define the non-singular matrix 



L = 



1 



\0 







1/ 



and use it to calculate the error matrix E = LE, which has t 
non-zero error symbols in its first row. Thus, we know from 
case 1) that Algorithm 1 synthesizes the correct pair t, A(x) 
for E, and that Algorithm 2 consequently yields the correct 
solution for the matrix E. Due to Lemma 2, we know that 
Algorithm 2 also yields the correct solution for the error matrix 
E, which proves Theorem 1 for the case 2). 

■ 

Beyond the guaranteed error correcting radius we may be 
able to locate the errors, as long as the number of unknowns 
in (4) is not larger than the number of equations, i.e., as long 

as 



t < 



I 



l + l 



N - 



(9) 



is satisfied. Moreover, we are only able to evaluate the errors, 
if the number of errors does not exceed the number of 
redundancy symbols in the component code with the largest 
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dimension. This gives rise to the following theorem on the 
maximum error correcting radius: 

Theorem 2 (Maximum Correcting Radius) Consider an 
interleaved Reed-Solomon code pursuant to Definition 3. 
Assume that a word of this code is corrupted by t column 
errors. Then, Algorithm 2 may only find a unique and correct 
solution, if t satisfies 



t<t„ 



mm 



where 



(10) 



is the average dimension of the I Reed-Solomon codes. 

Proof: To find a unique and correct solution, the linear 
system of equations (4) with t unknowns and the set of linear 
recursions (5) of length t must have a unique solution, which 
means that rank (Si) = t. This is only possible, if (9) is 
satisfied, since otherwise the matrix Si has less than t rows. 
Hence, in order to be able to locate the errors, t must not be 
larger than the first argument in (10). Even if the locations of 
the errors are known, the error values can only be calculated 
if t is not larger than the number of redundancy symbols in 
every Reed-Solomon code. Thus, if t > N — K, it is not 
possible to evaluate the errors in at least one Reed-Solomon 
codeword. Consequently, t must also not be larger than the 
second argument in (10). ■ 
The Theorems 1 and 2 characterize the collaborative de- 
coding approach described by the linear system of equations 

(4) or equivalently by the set of linear recursions (5). If the 
number of column errors satisfy (8), i.e., if the number of 
errors is smaller than half the minimum distance of the Reed- 
Solomon code with the largest dimension, (4) and hence also 

(5) have a unique solution, and we are always able to correctly 
reconstruct the transmitted codeword A. If the number of 
errors lies between the guaranteed error correcting radius 
specified by (8) and the maximum correcting radius specified 
by (10), (4) and (5) may still have a unique solution, which 
enables us to correct errors beyond half the minimum distance 
of the component codes. In this sense, our collaborative 
decoding approach yields a Bounded Distance (BD) decoder, 
whose maximum error correction radius lies beyond half the 
minimum distance. However, as soon as the number of column 
errors exceeds the limit specified by (8), we are not able any 
more to guarantee that (4) and hence also (5) has a unique 
solution. In fact, the solutions of (4) and (5) will be ambiguous 
with some probability Pf > 0, since rank (Si) < t for some 
error patterns. 

The possibility to obtain an ambiguous solution is inherent 
to Bounded Distance decoders which correct errors within a 
radius beyond half the minimum distance, since some correct- 
ing spheres are inevitably overlapping in this case. Basically, 
three strategies are conceivable to cope with this problem: 

1) The decoder selects on single codeword out of all valid 
decoding results 
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2) The decoder returns a list of all valid decoding results 
that is, a list of all codewords which lie inside a sphere 
with radius i max around the received word y. 

3) If the decoding result is not unique, i.e., if there exists 
more than one valid decoding result, the decoder yields 
a decoding failure 

Since finding all valid solutions is algebraically difficult with 
our approach and usually requires a high computational effort, 
the decoders we consider here apply the third strategy: when- 
ever (4) does not have a unique solution, Algorithm 2 may 
yield a decoding failure (we formalize this statement later in 
Lemma 5). 

V. Distribution of the Redundancy 

Definition 3 does not restrict the choice of the dimensions of 
the I Reed-Solomon codes used to construct a heterogeneous 
IRS code. However, to ensure a good performance under 
collaborative decoding, we should apply some restrictions on 
the dimensions . . . , K^ l \ 

From (10) we observe that the maximum error correction 
radius depends on the average redundancy N — K and the 
minimum redundancy N — K of the components. If the 
correction radius is limited by the second restriction, our 
collaborative decoding strategy is not able to recover all errors, 
even if we are able to locate them. Hence, the redundancy 
should always be distributed in such a way that we are able to 
evaluate all errors if we manage to locate them. This is stated 
by the following theorem: 

Theorem 3 (Redundancy Distribution) Consider an inter- 
leaved Reed-Solomon code pursuant to Definition 3. For 
a given code rate R = j|, the error correction radius 
is maximized, if the dimension of the I component codes 
A^\ . . . ,A^ are chosen such that they satisfy 



K < 



i + i V i 



N 



K 



(11) 



Proof: For a given rate R = §, jj^ (N - K) is 
constant in terms of the choice of the dimensions in the 
codes A^\ . . . ,A^ l \ Hence, the error correcting radius is 
maximized for 

— — (N-K) <N-K. 



Solving for K yields 



K < 



1 



i + 1 



-N 



l + l 



-K = 



1 + lKl 



1 + K 



Clearly, there are not much reasons to consider IRS codes 
which do not fulfill (11). Therefore, we always assume in 
the following that IRS codes are designed in accordance to 
Theorem 3. 

By examining (11) we observe that asymptotically, i.e., 
for I — ► oo, the maximum dimension is limited by the 
average dimension. This means that for IRS code designs 
with a large I, the redundancy should be distributed equally 
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over all component codes. Though, if I is small enough, the 
redundancy may be distributed unequally, as long as we only 
care about the maximum error correcting radius, and not about 
the minimum distance of the IRS code. However, the minimum 
distance is only optimal in the homogeneous case. This is 
explained by the fact that homogeneous IRS codes are MDS, 
while the minimum distance of heterogeneous IRS codes 
is determined by the Reed-Solomon code with the largest 
dimension. Nevertheless, heterogeneous IRS codes can be 
interesting e.g. in the context of generalized concatenated code 
constructions introduced and described by Blokh and Zyablov 
[13] and Zinoviev [14], for decoding single Reed-Solomon 
codes beyond half the minimum distance like described in 
[15], and for several other applications. 

VI. Collaborative Decoding Performance 

To evaluate the performance of our decoding strategy, we 
would like to estimate the error probability P e , and the failure 
probability Pf, which together result in the probability P w 
for obtaining a wrong decoding result. We are particularly 
interested in the failure probability Pf (t) in the range t g < t < 
tmax, since the proposed decoding strategy certainly makes 
only sense if Pf (t) is small enough, such that the decoder only 
fails in a very few cases as long as t is below the maximum 
error correcting radius i max . 

A. Maximum Likelihood Certificate Property 

Before we derive upper bounds on Pf and P e , we consider 
another interesting property of our collaborative decoding 
strategy: we show that Algorithm 2 exhibits the Maximum 
Likelihood (ML) certificate property. This means that whenever 
the decoder does not fail, then it yields the ML solution, 
i.e., the solution with the minimum Hamming distance to the 
received word. This property will help us later, to overbound 

Pe- 

Definition 5 (ML Certificate) 

Consider a code C, and assume that the word y is received, 
when a codeword c e C is transmitted over a memoryless noisy 
channel. Moreover, consider a decoding algorithm which either 
decodes a codeword c € C or yields a decoding failure. We say 
that the decoding algorithm exhibits the ML certificate property, 
if whenever the decoder decides on a codeword c e C, there 
does not exist another codeword c' E C, which has a smaller 
Hamming distance to y than c. 

To show that Algorithm 2 exhibits the ML certificate 
property, we state the following lemma, which immediately 
follows from the definitions of the error locator polynomial 
and the syndromes in Section III: 

Lemma 3 Consider a codeword A G A. of an IRS code 
according to Definition 3. Assume that this word is corrupted 
by an error matrix E with t non-zero columns at the positions 
3\i 3ii ■■■ i3t- Hence, we observe a matrix R = A + E 
with I rows, from which we are able to compute the syn- 
dromes ,S^ 2 \ . . . ,S( l \ Then, the error locator polyno- 
mial A(x) = nj=i (l — ct~^x) is a t-valid polynomial, which 
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is a solution of the system of equations (4) with t unknowns, 
and of the set of linear recursions (5) of length t. 

Lemma 3 is helpful for proving the following theorem: 

Theorem 4 (ML Certificate of Algorithm 2) The decoder 
specified by Algorithm 2 exhibits the ML certificate property. 

Proof: If Algorithm 2 does not fail while decoding R, 
it yields a pair t, A(x), where A(x) is a i-valid polynomial 
satisfying the system of equations (4) with t unknowns, and 
the set of linear recursions (5) of length t. From A(x), the 
decoder computes an error word E with t non-zero columns, 
and a codeword A = R E which differs in t columns from 
R. Hence, the extension field representations a, and r of the 
matrices A and R fulfill d (a, r) = t. 

Now, we assume that there exists a codeword A' whose 
extension field representation a' has a smaller Hamming 
distance to r than a, i.e., 

d(a',r) =t' <t. 

Applying Lemma 3 to R and A' yields that there exists a 
t'-valid solution A' (a;) of the system of equations (4) with t' 
unknowns and the set of linear recursions (5) of length t' , 
where t' < t. However, since Algorithm 1 always finds a 
solution of (5) with the smallest t, this yields a contradiction 
to the assumption that dist(a',r) < dist (a, r). Hence, if 
Algorithm 2 finds a solution, this solution corresponds to the 
codeword with the smallest Hamming distance to R. ■ 

B. Error Probability 

Now, we derive an upper bound on P e for the general 
case that a linear block code is decoded by a BD decoder 
which exhibits the ML certificate property. For this purpose, 
we assume that the transmitted word is corrupted by t errors, 
and that the BD decoder is able to correct errors up to the 
radius i max . We consider a pair of codewords, and count 
the number of received vectors which may be decoded into 
wrong codewords. Then, we use a union bounding technique 
to overbound the error probability P e {t). The bound obtained 
in this way is directly applicable to IRS codes by interpreting 
the IRS codewords as vectors over the extension field ¥ q i . 

Lemma 4 Consider two concentric spheres si and s 2 in the 
Hamming space . Assume that si has radius T\, and s 2 
has radius r 2 . Fix an arbitrary point v on the surface of s±, 
and let U(q, r 2 , r\, p) be the number of points on the surface 
of S2 with distance p to v. Then, r 2 , ri, p) is calculated 
by 



U(q,r 2 ,ri,p) 

ri+r 2 -p 

E 

i=r ri+ ?~ p i 



ri\f i \(N-n 

i J \p — (ri + r 2 ) + 2iJ \ r 2 — i 

_ *)\p-(ri+r 2 )+2i _ \^ r2 ~ i 



(9-2) 



Proof: For proving Lemma 4, we assume w.l.o.g. that 
the spheres si and s 2 are centered around the origin. Then, 
we consider a vector v on s\. Since si is centered around 
the origin, wt (v) = n, and hence v has exactly r\ non- 
zero components. Now, we consider another vector w on the 
surface of s 2 , i.e., wt (w) — r 2 , and assume that v and w are 
overlapping in exactly i non-zero coordinates. The Hamming 
distance d(v,w) is calculated by 

d (v, w) = (n + (r 2 + 5(i) , 

where S(i) is number of differing symbols in the overlapping 
coordinates. Then, we count how many vectors w exist, for 
which d(v,w) = p, or equivalently 

$(*) = p - (n + r 2 ) + 2t . (12) 

Hence, we have 

% \(n - 0)p-(ri+r2) + 2i 

vP - (ri + r 2 ) + 2iJ yq ' 
possibilities to choose non-zero symbols in the overlapping 
coordinates which are different in the two vectors, and 

N - ri \ q -ir- i 

r 2 -i J 

non-zero symbols in the non-overlapping coordinates of w, 
in order to obtain a vector with Hamming distance p to v. 
Consequently, to calculate U(q, r 2 , r\, p), we have to sum up 
the product 

i \( N ~ ri " 

K p - (n + r 2 ) + 2i)\r 2 -i 

■ (q-2) p - (ri+r2)+2l (q-l) r2 - 1 

for all possible numbers of overlapping coordinates. To obtain 
the summation limits, we observe that S(i) is limited by < 
5(i) < i. By inserting (12), we obtain 

(ri + r 2 - p)/2 < i < n + r 2 - p . 



Hence we have to sum up from [ 



T1+T2 — P 



] to n + r 2 - p. 



Consequently, U(q, r 2 , ri, p) is calculated by the sum 



ri+r 2 — p 

E 



i \{N — n 

p-(n+ r 2 ) + 2i)\r 2 -i 



(9-2) 



p-(ri+r 2 )+2i 



(«-!)' 



This proves Lemma 4. ■ 
Theorem 4 and Lemma 4 enable us to prove the following 
theorem, which overbounds the error probability P e (t): 

Theorem 5 (Error Probability) Let C{Q; N, K, D) be a lin- 
ear block code of length N, dimension K, and minimum 
distance D over the field Wq, decoded by a BD decoder which 
exhibits the ML certificate property. Assume that the decoding 
radius of this decoder is £ max , and that it decodes a received 
word, which is corrupted by t errors. Then, the probability for 
a decoding error is overbounded by 

Eltfr 4E3 tlW} -u(Q,t, w ,p) 



Pe(t) < Pe(t) 



(13) 
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where A w describes the weight distribution of the code, i.e., 
A w is the number of codewords of weight w. 

Proof: Consider two codewords c e C and c'eC with 
Hamming distance dist (c, c') = w to each other. Assume that 
the codeword c is corrupted by t errors, and that the word y 
is observed at the output of the channel. For this arrangement, 
we overbound the pairwise error probability P(c — > c'), i.e, 
the probability that c' is decoded under the condition that c 
has been transmitted. To do this, we consider a sphere s of 
radius t centered at c. We count all points on the surface of 
s, which lie inside a sphere s' of radius i max around c', since 
they may be decoded into c' (see Fig. 2). However, we only 
count the points on s', which have a maximum distance of t 
to c', since otherwise they are closer to c than to c', and due 
to the ML certificate property we know that they are correctly 
decoded into the codeword c. 

In other words, we count all points on s, whose distance p 
to c' is in the range < p < min{t,i max }. For this purpose, 
we consider the two concentric spheres si and s 2 around c. 
We select the sphere S\ to have radius w, and the sphere s 2 
to have radius t. This means that d is a point on S\, and 
s 2 coincides with the sphere s. Hence we are able to apply 
Lemma 4 to count the number of points N p on the surface of 
s 2 with distance p to c' on si, where < p < min {t, t max }. 
This is illustrated by Fig. 2. 




/"'■si 



Fig. 2. Number of points which may contribute to P(c — > c') 

In this way, we obtain 

min{i,i max } 

N p = U(Q,t,w,p). 

p=0 

To overbound the total number N t of points which are decoded 
into wrong codewords, we use a union bounding technique and 
consider all codewords within distance D < w < t + t max to 
c, since the sphere s does not overlap with the correcting 
spheres of all other codewords. As there are A w codewords 
with distance w to c, we obtain 

t+t max 

N t < J2 A " N p- < 14 > 

w=D 
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To calculate P e (t), we divide Nt by the number S n (t) of 
points on the sphere s with radius t, which is calculated by 

S n(t)={ N t ) (Q-l)*- 

Since N t is overbounded by (14), we obtain the statement of 
Theorem 5. ■ 
Note that if we set i max = t g in Theorem 5, (13) coincides 
with the error probability for classical BMD decoding as 
described in [24]. The expression is actually exact in this case, 
since the spheres around the surrounding codewords do not 
overlap in the case i max = t g . 

Theorem 5 basically holds for arbitrary linear block codes 
over the field ¥ q , which are decoded up to the maximum 
error correcting radius t max by a decoder exhibiting the ML 
certificate property. To apply this theorem to IRS codes, we 
interpret them as codes over the extension field ¥ q i, and set 
Q = q l . Due to Theorem 4, Algorithm 2 exhibits the ML 
certificate property, and hence (13) provides us with an upper 
bound on the error probability P e (t). However, to calculate 
(13), we require knowledge about the weight distribution of the 
code. A homogeneous IRS code can be interpreted as an MDS 
code over the extension field ¥ q i . Since the weight distribution 
of MDS codes is well-known (see e.g. [28]), the weights A w 
can be calculated for this case by 

where D = N — K + 1. Unfortunately, heterogeneous IRS 
codes are usually not MDS. Hence A w can generally not be 
obtained in such a simple way. However, even if the weight 
distribution is not known for a heterogeneous IRS code, we 
are able to overbound P e (t) on the basis of (13), by replacing 
A w by A w , where A w is such that A w > A w V w = 0, . . . , N. 

C. Failure Probability 

Collaborative decoding of IRS codes provides us with a 
method of decoding errors beyond half the minimum distance, 
if the errors affect the columns of the interleaved scheme like 
depicted in Fig. 1. If the number of errors is smaller than the 
guaranteed correcting radius (8), we know from Theorem 1 
that we are always able to correct these errors. Contrariwise, 
if the number of errors exceeds the maximum correcting radius 
(10), the decoder will always fail or take an erroneous decision. 

In order to analyze the probability Pf (t) in the range t g < 
t < tmax, we assume that for each column of the interleaved 
Reed-Solomon code, each error pattern occurs equiprobable. 
More precisely, we assume, that the burst errors 



are random vectors, uniformly distributed over Fg\{0}. Under 
this assumption, bounds on Pf have been derived in [4], and 
[5] for homogeneous IRS codes. However, since these bounds 
are rather weak or do not depend on the error weight t, they 
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generally do not yield a good estimation of the decoding 
performance. 

To obtain a bound for heterogeneous IRS codes, we gener- 
alize a bound from [10] for homogeneous IRS codes, which 
allows for estimating the failure probability Pf(t) in depen- 
dence of t. For proving this generalized bound, we use similar 
but simpler techniques, which have been applied in [2] to 
bound the failure probability for folded Reed-Solomon codes. 
These techniques are based upon analyzing the linear system 
of equations (4), and overbounding the number of cases, in 
which the system of equations (4) with t unknowns yields 
multiple solutions. To obtain an upper bound on the failure 
probability of Algorithm 2 from this analysis, we have to 
show, that whenever Algorithm 2 yields a decoding failure, 
then there exist multiple solutions for (4). 



Further, let 



W = 



{w , . 



be a I x ijj matrix, such that wj ^ 0, Vj = 0, . . . , u — 1, 
i.e., that W does not have any all-zero column. Furthermore, 
assume that all columns of W are uniformly distributed over 
all non-zero vectors of length I. Then, the probability P w that 



w 



MeJCW V*=1,...,Z 



is overbounded by 



W i) 



(15) 



(16) 



Lemma 5 Consider a codeword A G A. of an IRS code 
pursuant to Definition 3. Assume that this word is corrupted 
by an error matrix E with t non-zero columns, and that 
Algorithm 2 yields a decoding failure. Then, the linear system 
of equations (4) with t unknowns has multiple solutions. 

Proof: Assume that a codeword A e A has been 
transmitted and Y = A + E is received, where E has t non- 
zero columns. Algorithm 2 only yields a decoding failure, if 
Algorithm 1 computes a pair t', A' (a;), such that A' (a;) is not 
t'-valid. 

First we show that t' < t. Applying Lemma 3 to Y and A 
yields that there exists a t-valid polynomial A(a:), which is a 
solution of (5). Since Algorithm 1 always yields the smallest 
length shift-register which is a solution of (5), we know that 
t' < t. 

Second, A' (a;) is a solution of the set of linear recursions 
(5) of length t', and hence also for the system of equations (4) 
with t' unknowns. However, if A'(x) is a solution for the set of 
linear recursions (5) of length t', it is also a solution for length 
t > t', since if A' (a;) fulfills (5) for i > t', it obviously also 
fulfills it for i > t > t'. Hence, A' (a;) is not only a solution 
of the system of equations (4) with t' unknowns and the set 
of linear recursions (5) of length t', but also for the system 
(4) with t unknowns, and the set of recursions (5) of length 
t. Consequently, the system of equations (4) with t unknowns 
has at least the two solutions A(a;) and A' (a;). This proves the 
statement of Lemma 5. ■ 
Note that we only show that if Algorithm 2 yields a decoding 
failure, then (4) has multiple solutions. Contrariwise this does 
not necessarily mean that Algorithm 2 always fails if (4) has 
multiple solutions. However, since we are only interested in 
overbounding the failure probability Pf, Lemma 5 is sufficient 
for our purposes, since the number of events for which (4) has 
multiple solutions is an upper bound for the number of events 
in which Algorithm 2 fails. 

Hence, to overbound the probability Pf that Algorithm 2 
fails, we consider the following lemma: 



Proof: Let L be the set of all I x w matrices whose rows 
fulfill Equation (15). Further, let be the set of all I x u> 
matrices with elements from F g , and let the subset <S^ C 
be the set of matrices without any non-zero column. Then, the 
probability P w that a matrix W without any all-zero column 
fulfills (15) can be calculated by 

|£ns«| < \c\ 
\sz\ - \sz\ ' 



p., = 



The cardinality \C\ is obtained by 



\c\ = n ijcw 



(t) 



and the cardinality is calculated by 



IS" I 



1 



Consequently, P^ is overbounded by 



P., < 



q' 



Ijj 



W i) u 



- Y l o< 



Lemma 6 enables us, to state and prove the following 
theorem: 

Theorem 6 (Failure Probability) Consider an interleaved 
Reed-Solomon code pursuant to Definition 3, which is decoded 
by Algorithm 2. Assume that the codes . . . , are 

chosen such that (11) is satisfied. Furthermore assume that 
A is corrupted by t column errors, where each column vector 
is an independent random vector uniformly distributed over 
¥ l q \ {0}. Then, the probability for a decoding failure is 
overbounded by 



Pf(t) < Pf(t) 



- 1 



-(* + l)(tmax-t) 

9-1 



(17) 



where i max 
radius. 



j (N — K) is the maximum error correcting 



Lemma 6 Let IC^^JC^, . . . ,K,® be I q-ary linear codes of Proof: According to Lemma 5, the failure probability 

length uj, and let the dimension of the code JC^ be lu — of Algorithm 2 can be overbounded by considering the cases, 
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in which the system of equations (4) with t unknowns has 
multiple solutions. We have such a case whenever rank (Si) < 
t, i.e., whenever there exists a column vector w^O, such that 
Si ■ u = 0. Equivalently we can say that (4) cannot have a 
unique solution, if 



3u^0 : S W u = V£ = l, 



(18) 



Since the syndrome matrices S^\ S^, . . . , S^ directly de- 
pend on the error matrix E, we are able to express the failure 
probability Pf(t) in a general way by 

number of matrices E t satisfying (18) 



Pf(t) = 



total number of matrices Et 



where E t denotes an error matrix with exactly t non-zero 
columns. Now, we consider matrices with non-zero columns 
at fixed indices 31,32, ■ ■ ■ ,jt- More precisely, for a fixed 
set {31,32, ■■■ ,3t\ °f t indices, we consider the ensemble 
£t (ji, ■ ■ ■ ,jt) of matrices, in which every column with in- 
dex j E {ii)j2) ••• ,jt} is an independent random vector 
uniformly distributed over \ {0}, and all other columns 
are zero vectors. Then, the probability that (18) is satisfied for 
matrices E t from the ensemble £ t (ji, ■ ■ ■ ,jt) is calculated by 



Pf(h 



Jt) 



I {Ee£ t (ji,..., j t ) :E satisfies (18)}| 



\£t(ji,-..,jt)\ 

We will now derive an upper bound on Pf(ji, ■ ■ ■ , jt ), which 
does not depend on the selection of the indices 31,32, ■ ■ ■ ,jt, 
but only on the number of erroneous columns t. Hence, this 
bound will directly provide us with the upper bound on Pf(t), 
in which we are actually interested in. 

For calculating Pf(ji, ■ ■ ■ ,jt), let the number of rows in 
S w be denoted by g^\ i.e., gW = N-K^-t. It is known 
(cf. e.g. [24]) that a syndrome matrix S^ can be decomposed 
into 

S W = H W . F W DV. 



At this, the matrix, 
V = 



1 



a? 



a (*-iWi\ 



a 



(t-i)h 



\l a? 



v 2 Jt 



a 



(t-i)jt 



is a t x t Vandennonde matrix, the matrices 
D = diag (a 31 , a 32 , . . . , cr** ) 

and 

^)=diag(e(f,ef,..., e (f) 
are t x t diagonal matrices, and the matrix 



a J 



a 



2./1 



a 



2h 



is a x t matrix consisting of g^ rows of a transposed 
Vandermonde matrix. Hence, represents a parity-check 



matrix of a (shortened) Reed-Solomon code of length t and 
dimension t — which we denote by We observe 

that the matrices V and D both have full rank. Therefore, the 
product v = D ■ V ■ u defines a one-to-one mapping u v, 
such that 1 ► 0. Consequently, the statement 

3v^0 : H {1) ■ F (l) ■ v = \/l=l,...,l (19) 

is equivalent to Equation (18). With w (l > = , and 

the fact that is a parity-check matrix of the code JC^\ we 
can state another equivalent condition for a decoding failure: 

Assume that we have a vector v with Hamming weight 
wt(v) — uj. Then, the vectors I — have at 

most uj non-zero components. Now consider the matrix 



W = 



(0 



Since we know that all vectors ej i = ^ejj\ . . . , 
1, . . . , t, are non-zero, and that all non-zero error patterns are 
distributed uniformly, we also know that W contains exactly 
oj non-zero columns uniformly distributed over all non-zero 
vectors in F^. Assume that the non-zero columns in W are 
located at the indices ii, 12, ■ ■ ■ , i^, let W u be a I x u) matrix 
consisting of the non-zero columns of W, and let be 
obtained from by removing all columns whose indices 
are not in the set {ii, i 2 , ■ . ■ , iu,}- Furthermore, denote by JCffl 
the code defined by the uj x parity-check matrix . 
Then, the statements H {1) ■ W T = 0, and ■ W T U = 
are equivalent. Consequently, we can apply Lemma 6 on /c£f\ 
and to overbound the probability that a fixed vector 
v of weight uj satisfies 



v = () Vl = l,...,i 



(20) 



We observe that the probability P(v) for a vector v to fulfill 
(20) is independent of the positions of the non-zero symbols 
in v, but only depends on the weight wt (v) = uj, i.e., 

P(v : wt (v) =uj)=P UJ . 

Hence, the probability Pf(ji, ■ ■ ■ ,jt) that (18) or equivalently 
(19) is satisfied can be overbounded using a union bounding 
technique, by summing up over all non-zero vectors v: 



Pf(ji,...,jt) < 



E 



P(v) 



t 

E 



E 



Pu, ■ (21) 



veF'\{o} 



U)=l {v:wt(v)—uj} 



Since the right side of (21) is independent of the indices 
31,32, ■ ■ ■ ,jt but only depends on t, we see that (21) is also 
an upper bound on the failure probability Pf(t). 

To improve (21), we should take care about the following 
fact: if a vector v fulfills (20), a vector v' = av also 
fulfills (20) for all a € F 9 \ {0}. Therefore, we call v and 
av equivalent vectors. Since there are q — 1 different non- 
zero elements in F 9 , there exist q — 1 equivalent vectors 
for each non-zero vector over F„. Thus, the number M w of 
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non-equivalent vectors of length t with a certain weight uj is 
calculated by 

't\(q-ir = ff 



Ma, 



q-1 



(9-1) 



(22) 



Hence, to obtain a better upper bound on Pf(t), we can 
multiply the probabilities P w bounded by (16) by the number 
of non-equivalent words of weight ui calculated by (22), and 
sum up over all weights 1 < u) < t. In this way we obtain 



t 



Pf(t)<^2P u -M u < 
t 

E 



< 



q ^' 



q - 


I— 1 


q i+i 


- 1 


q l - 








q l - 




q l - 




q l - 






q l - 





LU = 
t 

q 



t\ ( {q-l)q l 
cjj \ ql-1 

Eli e <e) 



q 



q-l 

-l(N-K)+(l+l)t 



q 



q-l 

-(* + l)(tmax 



q-l 

This proves Theorem 6. ■ 
For the case of homogeneous IRS codes, the bound (17) 
was mentioned the first time in [9] without proof. A similar 
but somewhat weaker bound is presented in [6] for decoding 
homogeneous interleaved Algebraic Geometry (AG) codes. 
Applied to homogeneous IRS codes, this bound basically 
differs from (17) by the factor -^j. 

D. Comparison of Pf, and P e 

The decoding performance of an IRS scheme is mainly 
influenced by the probabilities Pf(t) and P e (t) in the range 
t g < t < t max - To get an impression of these probabilities, 
we consider a homogeneous IRS code composed of three 
codewords of the code A = US (2 s ; 255, 223, 33). This 
Reed-Solomon code is used in several practical applications, 
and guarantees to decode all error patterns with a weight up to 
16. The resulting IRS code A. can be considered as MDS code 
of length 255, dimension 223, and minimum distance 33 over 
the field F/ 2 s)3. According to Theorem 2, our collaborative 
decoding strategy is able to correct up to 24 errors. Hence, 
we calculate the upper bound on Pf(t) pursuant to (17), and 
the upper bound on P e (t) pursuant to (13) in the interval 
17 < t < 24 to figure out, in how many cases the decoder 
fails or yields an erroneous result in the worst case. The results 
are depicted by Figure 3. We observe that both probabilities 
decrease exponentially with decreasing t, and that the the error 
probability P e (t) is smaller than the failure probability Pf (t) 
by several orders of magnitude in the complete range. Hence, 
the decoder performance is usually dominated by the failure 
probability Pf, and not by the error probability P e . 

E. Overall Word Error Probability 

The probability P w = P e + Pf is the probability for Algo- 
rithm 2 to yield a wrong decoding result, i.e., the probability 
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Fig. 3. Upper bounds on the failure and error probability for an IRS code 
composed of three codewords from the code TZS (2 8 ; 255, 223, 33) . 



for obtaining a decoding error or a decoding failure. For errors 
of weight t < t g , we know from Theorem 1 that we never 
fail or get an erroneous decoding result, and consequently 
Pw{t) — 0. For a received word R with t > i g column 
errors, the probability P w (t) is simply the sum of the error 
probability P e {t) and the failure probability Pf(t). Hence, 
P w (t) is overbounded by the sum P e (t) + Pf(t), or more 
precisely by 



P w {t) < P w {t) 







t < to 



(23) 



min{P e (i) + P f (t),l} t>t g 

Now, we consider a codeword A e A with elements from 
¥ q , which is corrupted by an error matrix E. We assume that 
a non-zero column in E occurs with probability p, and that 
the non-zero column vectors are distributed uniformly over 
F' \ {0}. With this error model, the probability for t column 



errors is 



p* ■(!-!>) 



N—t 



and we can overbound the overall word error probability by 

N 



■ p w (t) ■ p i ■ (i - P ) 



N-t 



(24) 



VII. IRS Codes in Concatenated Code Designs 

Reed-Solomon codes are commonly used as outer codes 
in concatenated code designs. In many applications, we even 
find schemes with interleaved Reed-Solomon codes of some 
kind. However, usually the Reed-Solomon codewords of such 
schemes are decoded independently, i.e., the special inter- 
leaved structure is typically not taken into account by the 
decoder. 

To investigate the applicability of collaborative decoding 
strategies for IRS codes in concatenated code designs, we 
consider simple concatenated schemes with outer IRS codes 
and inner block codes. 
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A. Concatenated Code Construction 

To construct a concatenated code, we consider a codeword 
A e A. of an IRS code over the field F g , where q = p m . We 
group the elements of the codeword matrix 

A = : = (ao, . . . , ajv-i) , 
into the column vectors 



a,j 



i = o, 



,N-1 



Then, we select a linear code of length n, dimension k = Im 

distance d ov 
one mapping g(a) : a € F„ 



and minimum distance d over the field F p and define a one-to 



b € B. We apply this mapping 
to the column vectors aj, j = 0, . . . , N — 1, of the matrix A. 
In this way, we obtain a matrix 

C = ( 5 (oo), • • • , fl(ajv-i)) - (tj, • • • , , (25) 

in which all columns are codewords of the inner code B. We 
define the set {C} of all matrices obtainable in this way to be 
the concatenated code C. For practical applications, usually 
it is preferable to have codes with a binary representation. 
Therefore, we consider binary inner codes, i.e., p = 2, and 
outer IRS codes over an extension field of F2, i.e., q = 2 m . We 
assume that the concatenated code C is used for transmission 
over an Additive White Gaussian Noise (AWGN) channel with 
Binary Phase Shift Keying (BPSK) modulation. In this case, a 
codeword 



C = (b T , 



c i,j € F 2 , is mapped to the signal matrix 

X = (xq, . . . , xj f _ 1 ) = (xij^j 

by Xi.j = (— l) Ci ,j , and transmitted over the AWGN channel. 
At the output, we observe the matrix 

Y = {yl,---,ylr-i) = (yi,j) , 

with yij = Xij + riij. Since we use energy 1 to transmit 
a binary code symbol, the variables njj are statistically 
independent Gaussian random variables with zero mean and 
variance a 2 = No/2, where N n is the single sided noise 
power spectral density. Moreover, for a concatenated code C 
of rate R — |p, the effective energy for transmitting a single 
information bit is specified by the code rate R. Hence, we are 
able to characterize the AWGN channel by its signal-to-noise 
ratio, or more precisely by the ratio = 2 ^ ai . 

B. Decoding of the Concatenated Code 

To decode Y with respect to the inner code, we use a 
Maximum Likelihood (ML) decoder for the code B to find 
the ML estimates 



arc max 
beB 



rK^rKlJN 1 -- 



(P(b T \y])) (26) 

PREPRINT - 



for all columns of Y. Then, we use the inverse mapping 
g~ l {b J ) : b T e B ^ a e ¥ l q to obtain the matrix 

<V(i)> 



/ r (l)\ 

R=(g- 1 (b T ),...,g- 1 (b T N _ 1 ))= : 



(27) 



Each row £ — 1, . . . , I, of R corresponds to an input word 
of the Reed-Solomon code Hence, basically any row 
can be decoded independently from all others with respect 
to the corresponding Reed-Solomon code However, 
since an erroneous decision of the inner ML decoder may 
affect a complete column of the matrix R, it occurs to be 
more expedient to apply the collaborative decoding strategy 
described in Section III to decode all rows of the IRS code 
simultaneously. In this case, the word error probability p w 
at the output of the inner decoder will be the column burst 
error probability at the input of the collaborative decoder for 
the outer IRS code. In other words, the probability p w is the 
probability that a column of R is erroneous. 

Unfortunately, the exact analytical calculation of p w is 
generally not feasible. However, there are several known 
techniques to bound p w from both sides. One of the easiest and 
most widely known techniques to overbound the ML decoding 
error probability is the Union Bound. 

C. Error Probability after Inner ML Decoding 

Unfortunately, the Union Bound is not very tight for chan- 
nels with a bad to moderate signal-to-noise ratio, so that it is 
not suitable for analyzing our concatenated scheme. Therefore, 
we consider Poltyrev's Tangential Sphere Bound (TSB) [29]. 
Using this bound, p w can be overbounded by 

Pw<P TSB = A w j(w,a 2 ,t ) + ((a 2 , t ) , (28) 



W — l 



where 



l{w,a 2 ,t ) = -exp ( 

oo V*o 

• / / - X „ exp (-x 2 ) cosh ( 2x. , 



n-2 t - y 2 2 
2 ' 1 + y 2 X 



dydx , 



«*«*£/«p(-(. + yg)J- 

— oo ^ ' 

•T ((n-l)/2,t x 2 )dx , 



and 



T(a,z) 



T(a) 



T(a,z) 



is the normalized incomplete Gamma function. The parameter 
to is calculated from the cone half-angle 



6,., = arccos 



t (n - w) / 
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by solving the equation 

n-l 



Y,A W (l - u (w - / sm"- 3 (^)^ = 



r((rc-2)/2) 
VxT((n -l)/2) 



where 



u(a;) 




is the Heaviside step function. 

To bound p w from below for channels with bad and moder- 
ate signal-to-noise ratios, Shannon's Sphere Packing Bound 
[30] is well suited for our purposes. The Sphere Packing 
Bound is calculated by 



Pw > P — — 7T~ i 

- SPB 2¥r((n-l)/2) 



n — 2 

x exp 



erfc 



- cot(0)a; 



da; , (29) 



2 y ~" v V2 

where the half angle 9 is obtained by solving the equation 

V^((n -l)/2) 



sin"- 2 (i?)di? 



MT(n/2) 



with M = 

For good channels, i.e., high signal-to-noise ratios, the 
Sphere Packing bound is known to be rather weak. Therefore, 
we use Seguin's L 2 -Bound [31] to bound p w from below for 
good channels. The L 2 bound is calculated by 



Pw > P, 



(30) 



W—l 



At this, the denominator £1 is calculated by 

+ 2{A W - 1)V (a(w, w ),\l^2> + 



where 



a(i,j) = min 



\i_ [J i+j-d 
j ' V V 2y/Tj 



and 



i - p 2 1 1 



exp 



u 2 — 2puv + 
2(1 -P 2 ) 



dudv 



To obtain a lower bound on p w , we take the maximum of 
the Sphere Packing bound (29), and the L 2 bound (30), i.e. 



Pw > P — max \p ,p } . 

1W — —W L—SPB —L 2 j 



(31) 



Since the TSB is quite tight for all channel conditions, it 
directly serves us as upper bound on p w , i.e., 



Pw < P w - Pi 



(32) 



The bounds (31) and (32) turn out to be beneficial to derive 
bounds on the decoding performance after outer decoding. For 
decoding the outer code, we basically have two options. Either 
we apply Algorithm 2 for collaboratively decoding the outer 
IRS code, or we use a classical BMD decoder to decode the 
I outer Reed-Solomon words independently row by row. 

D. Word Error Probability with Collaborative Outer Decoding 

We first analyze the collaborative decoding strategy, which 
applies Algorithm 2 for decoding the outer code. To overbound 
the word error probability in this case, it would be helpful 
to apply the Theorems 5 and 6 derived in Section III. However, 
Theorem 6 requires that all column errors occur equiproba- 
ble in the IRS scheme. Unfortunately, this is not true after 
decoding the inner code, since due to the characteristics of 
ML decoding, low-weight error patterns occur more frequently 
than high-weight patterns. To be able to apply Theorem 6 
anyway, we slightly modify the our concatenated coding 
scheme to randomize the error patterns after inner decoding. 
For this purpose, let Mi be the set of all non-singular I x I 
matrices with elements from the field ¥ q . Now we modify the 
encoding rule given by Equation (25) into 

C = (g(M ■ ao), . . . , g(M N -! ■ ajv-i)) 

(33) 



— (@0 j • • • ,Pn-iJ j 



where the matrices Mi are statistically independent random 
matrices, uniformly distributed in Aii. The reverse mapping 
after inner decoding described by Equation (27) is modified 
to 



R 



is the bivariate Gaussian function. 
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( K M^-g-\pl),...,M- N \-g- l {(3 T N _ 1 )) ■ (34) 

This randomization procedure does not influence the un- 
corrupted columns in R, but only ensures the erroneous 
columns after inner decoding to be transformed into uniformly 
distributed error patterns. Since the number of erroneous 
columns is not changed by randomization, it should have no 
negative impact on the decoding performance. However, this 
randomization procedure allows us to apply Theorem 6 for 
estimating the failure probability after outer decoding. Later, 
we will observe by means of experimental results that from a 
practical point of view randomization is not necessary, since 
the decoding results will be virtually the same, regardless 
whether we use randomization or not. 

Theorem 7 (Upper Bound on P^) Consider a concate- 
nated code design with randomization as described above, and 
assume that the outer component codes are chosen such that 
(11) is satisfied. Furthermore, assume that the words of the 
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inner code are decoded by a Maximum-Likelihood Decoder 
and the words of the outer interleaved Reed-Solomon code 
are decoded jointly by Algorithm 2. Then, the word error 
probability after decoding is overbounded by 

P c w < E { N t )-P w {t)-p N {t,p w ,p w ) , (35) 



t=t g +i 



where 



and 



P w (t) ±mm{P f (t)+P e {t),l} 



Pn 



(1-p ) (A, - t) , i < p N 



Pl-il-pJ^, p,N<t<p w N. 

(36) 



Proof: For proving Theorem 7, we consider the fact that 
the word error probability p w after inner decoding coincides 
with the column error probability p at the input of the 
outer IRS decoder. Hence, according to (24) the word error 
probability P^ after outer decoding is overbounded by 

N / \ 

E ( J -P^-pKI-pX-*, (37) 
t=t,+i V r / 



where 



P w (t)±mm{P f (t)+P e (t),l} ■ 
To further overbound (37), we define the function 



^Pi 



(1 ~Pv 



\N-t 



and calculate the derivative 



dp* 



-J(p W )=tp t - 1 (l-p w ) 



N-t 



.pt w (N-i)(l-p w ) 



N-t-l 



(38) 

From this derivative we observe that f(p w ) is monotonically 
non-decreasing as long as p w < jj, and monotonically non- 
increasing, if p w > -jij. Consequently, to overbound f(p m ), 
we replace p w by p w if t < p w N, and by p^ if t > p N. In 
the range p N < t < p w N, we do not know, whether f(p w ) 
is increasing or decreasing, so we overbound it by ,f(p w ) < 
— P^)^"*- This results in the following upper bound: 

f(p w ) < Pn (t,P w ,P w ) = 

V ) ( - N - t \ t < p N 

i—W v — w' —W 

= < Pi-(l-pJ N - t} , P w N<t<p w N . 
^■(I-Pj^-V, t>p w N 

Hence, replacing f(p w ) = p* w ■ (1 - p w ) N ^ in (37) by 
Pn (tiPw'Py^) yields (35), and consequently proves Theo- 
rem/. ■ 



E. Word Error Probability with Independent Outer Decoding 

Now, we consider the case that we decode each row of R 
independently by a classical BMD decoder. We overbound the 
word error probability P^ for independent decoding, i.e., the 
probability that BMD decoding fails or is erroneous for at least 
one of the I rows, by the following theorem: 

Theorem 8 (Upper Bound on P^) Consider a concate- 
nated code design as described above, and assume that the 
outer component codes are chosen such that (11) is satis- 
fied. Furthermore, assume that the words of the inner code 
are decoded by a Maximum-Likelihood Decoder and that 
the words of the outer interleaved Reed-Solomon code are 
decoded independently by a BMD decoder. Then, the word 
error probability after decoding is overbounded by 



P 1 < 



N 

E 

t=t„+i 



■Pn 



(t,P w ,P w ) , 



(39) 



where p N (t,p wl p^j is calculated by (36). 

Proof: If the number of erroneous columns in R is t < 
t g , we always obtain a correct decoding result. To overbound 
P^, we assume that for t > t g the independent decoding 
strategy based on standard BMD decoders never takes a 
correct decision. In other words, we simply replace P w (t) 
in Theorem 7 by one, which directly yields the statement of 
Theorem 8. ■ 
Theorem 8 allows us to estimate the worst case decoding 
performance for independent decoding of the outer Reed- 
Solomon codes. However, to be able to assess the potential of 
our collaborative decoding strategy, we also require a lower 
bound on P^. For this purpose, we first consider the following 
lemma: 

Lemma 7 Consider an IRS code according to Definition 3, 
and assume that we have t column errors, uniformly dis- 
tributed over F^\{0}. Further, assume that the Reed-Solomon 
codewords are decoded independently by a BMD decoder. 
Then, the probability that a corrupted IRS word is not decoded 
correctly, i.e., the probability that the BMD decoder fails to 
decode at least one Reed-Solomon word, is lower bounded by 



Pi{t)>P\{t)±l 



m=iE 



8 = 



(9-1)' 



(40) 



where t 



n-kW 



Proof: Consider the set J\f of all I x t matrices, such 
that the Hamming weight of row £, £ = 1 , . . . , I is smaller or 
equal to tg'. The cardinality of this set is calculated by 

k = i-A/] = n e (!) ^- i ) < - 

1=1 8 = W 

Each matrix in Af which has no all-zero columns, represents 
a correctable error pattern, i.e., the BMD decoder is able to 
decode 

N r < N* 
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different error patterns with t erroneous columns. The total 
number of matrices without any all-zero columns is [q l — l) . 
Hence, the probability for an error pattern with t column errors 
to be decoded is 



pUt) 



< 



N* 



W - 1) " W - 1) 



t • 



Consequently, the probability that an error pattern with t 
column errors is not decoded correctly, is obtained by 



PI (t) = 1 - P c (t) > 1 - 



N* 



W 1) 



= 1 nUE^oO (g-ir ^ 



Lemma 7 enables us, to state and prove the following 
theorem: 



Theorem 9 (Lower Bound on P^) Consider a concate- 
nated code design as described above, and assume that the 
outer component codes are chosen such that (11) is satis- 
fied. Furthermore, assume that the words of the inner code 
are decoded by a Maximum-Likelihood Decoder and that 
the words of the outer interleaved Reed-Solomon code are 
decoded independently by a BMD decoder. Then, the word 
error probability after decoding is bounded from below by 



P l > 



E ( N t )-pie(t)-p N (t'P w ,p w ) > (41) 



t=t„ + l 



where 



P N (^Pw,P v 



is calculated by 



pi 



En (t'PwPj) ~ SEl^ 1 



pt 



-pj^-v, 

(N-t) 



) 



t < p N 

—W 

p N <t< p w N . 

—W w 

t>P w N 



Proof: Assume that an IRS codeword is corrupted by 
t > t g column errors. Then, according to Lemma 7 these 
errors cannot be decoded successfully by independent BMD 
decoding with a probability of at least P^ (t) . Hence, to obtain 
a lower bound on P^, we weight the probabilities 



Pw ■ (1 -Pw) 



n—t 



for having t column errors by P] e (t), and sum them up from 
t g + 1 to N. In this way, we obtain the bound 



PL > 



N 

E 

t=t +i 



EA^-pI-^-Pu,) 



N-t 



(42) 



monotonically non-increasing for p w > . This allows us to 
lower bound f(p w ) by 



fiPw) >P N (t,p w ,pj) = 

'ft-Cl-pJ^-O, 



Pi-a-pJ^ 



t<pN 

— —w 

p n < t < p,.,N . 
t>P w N 



As before in the proof of Theorem 7, we consider the function 
f(Pw) = Pi ■ (1 — PwT" 1 and its derivative (38), and find 
that f(p w ) is monotonically non-decreasing for p w < and 

PREPRINT ~ PREPRINT ~ PREPRINT 



v i—W 7 

Replacing f(p w ) in (42)by£ w (t,p w ,pjj proves Theorem9. 

■ 

VIII. Performance Evaluation 

The bounds described in the previous section allow for 
analytically estimating the probability P w of independent and 
collaborative decoding. To demonstrate the achievable gain 
by collaborative IRS decoding, to verify our analytic results, 
and to get a visual impression of the bounds described in the 
previous section, we complement our investigations by Monte 
Carlo simulations. Moreover, we compute the upper and lower 
bounds described in Section VII for two specific concatenated 
code designs to understand the asymptotic behavior of these 
bounds. 

In the following, we consider the concatenated code C\, 
which is obtained from a homogeneous IRS code composed 
of I = 2 codewords of the code A = US (2 6 ; 63, 54, 10). 
The columns of this IRS code are encoded by the well known 
Golay code B = Q (23). This yields a code of length 1449, 
dimension 648, and rate 0.45 over the binary field F 2 . 

The second code C 2 is composed of a homogeneous IRS 
code created from I = 3 words of the Reed-Solomon code 
A = IIS (2 s ; 255, 223, 33), and a binary inner (30,24,4) 
code B, obtained by doubly shorting the Reed-Muller code 
1ZM. (3, 5). In this way, we obtain a binary code of length 
7650, dimension 5352, and rate 0.7. 

A. Monte Carlo Simulations 

For bad and moderate signal-to-noise ratios, the decoding 
performance can be analyzed by Monte Carlo simulations. 
We perform simulations for independent decoding and for 
collaborative decoding for the two codes C\ and C 2 . For 
collaborative decoding, we consider two variants: collaborative 
decoding with randomized errors like described in Section VII, 
and collaborative decoding without randomization. In this way, 
we obtain reliable results for block error probabilities larger 
or equal to 10~ 6 . For smaller block error rates, reliable results 
can only be obtained by extensive computational efforts. To 
check the tightness of the upper bounds (39) and (35), we 
compare our simulation results with these bounds. The results 
of these investigations are shown in Fig. 4 for the code C\ and 
in Fig. 5 for the code C 2 . 

For the code C\ we observe a gain of collaborative decoding 
of about 0.6 dB, at a block error rate P w = 10~ 6 . The 
other way round, for an ratio of about 4 dB, collaborative 
decoding is about 100 times superior to independent decoding. 
The upper bounds (39) and (35) are approximately 0.2 dB 
worse than the actual decoding performance in the depicted 
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Fig. 4. Simulated decoding performance of the concatenated code Ci 
composed of two codewords from the outer code TZS (2 6 ; 63, 54, 10), and 
an binary inner Golay code Q (23), AWGN channel, BPSK modulation. 



Fig. 5. Simulated decoding performance of the concatenated code C2 com- 
posed of three codewords from the outer code 1ZS (2 s ; 255, 223, 33), and 
an binary linear inner (30, 24, 4) code, AWGN channel, BPSK modulation. 



range. However, the horizontal offset of the two bounds is also 
in the range of 0.5 to 0.6 dB, so that the bounds can be used 
for a rough estimation of the expected gain of collaborative 
decoding. 

For the code C 2 , we observe a collaborative decoding gain of 
about 0.3 dB at P w = 10~ 6 , or in other words, the word error 
rate is about 1000 times smaller with collaborative decoding 
compared to independent decoding. The upper bound (39) 
is about 0.4 dB worse than the actual decoding performance 
for independent decoding, while the gap between the bound 
(35) and the collaborative decoding performance is about 
0.3 dB. Hence, an estimation of the collaborative decoding 
gain obtained by comparing (39) and (35) is a bit optimistic 
in this case. 

We observe for both codes that we obtain virtually the 
same error probabilities P w for collaborative decoding with 
randomized errors, and for decoding without randomization. 
This means that from a practical point of view, randomization 
has no significant influence on the collaborative decoding 
performance. Hence, randomization is only necessary to fulfill 
the statistical requirements to derive a bound on the failure 
probability Pf. 



B. Asymptotic Considerations 

Clearly, Monte Carlo simulations are only feasible, if 
the target block error probabilities are not too small. For 
applications which require very low block error rates like 
data storage systems or optical data transmission systems, 
we need analytical tools to estimate the obtainable decoding 
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performance. The bounds derived in Section VII provide us 
with such tools, since they become tight for high signal-to- 
noise ratios. To see this, we calculate the bounds (41), (39), 
and (35) for the codes C\ and C 2 - The results for C\ are 
depicted by Fig. 6, and the results for C 2 are depicted by 
Fig. 7. 

We observe for both codes that the lower bound (41) for 
independent decoding is not very tight for channels with small 
signal-to-noise ratios, but converges to the upper bound (39) if 
the signal-to-noise ratio increases. For the code C\, the lower 
bound (41) and the upper bound (39) coincide beginning from 
an -^-ratio of about 8dB, for C 2 the two bounds coincide 
a little later at about 9dB. This means that asymptotically, 
i.e., for jf- — ► 00, we exactly know the performance of 
independent decoding. 

Since we have the upper bound (35) on the collaborative 
error probability, and the lower bound (41) on the error 
probability of independent decoding, we are able to calculate 
the guaranteed collaborative decoding gain, i.e., the horizontal 
gap between (41) and (35). In Fig. 6 we observe that for C\ 
the collaborative decoding gain first increases, if the signal-to- 
noise ratio increases, and then vanishes again asymptotically, 
i.e., for jf- — > 00. This behavior is explained by the fact 
that the asymptotic gain is specified by the guaranteed error 
correction radius, which in turn is specified by the minimum 
distance. In this sense, the asymptotic gain is rather a code 
property than a decoder property. 

However, for all target block error rates which could be 
relevant for practical applications, we obtain a collaborative 
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Fig. 6. Bounds on the decoding performance of the concatenated code Ci 
composed of two codewords from the outer code TZS (2 6 ; 63, 54, 10), and 
an binary inner Golay code Q (23), AWGN channel, BPSK modulation. 



Fig. 7. Bounds on the decoding performance of the concatenated code C2 
composed of three codewords from the outer code TZS (2 8 ; 255, 223, 33), 
and an binary linear inner (30, 24, 4) code, AWGN channel, BPSK modula- 
tion. 



decoding gain by applying Algorithm 2. In Fig. 7 we see that 
the collaborative decoding gain, which can be guaranteed by 
our bounds, grows to more than 1 dB, when the signal-to- 
noise ratio improves to 9 or 10 dB. Since the computational 
complexity of Algorithm 2 is of the same order as the compu- 
tational complexity of the Berlekamp-Massey algorithm, this 
gain can be obtained virtually for free. 

IX. Conclusions 

In this paper, we presented a decoding algorithm for IRS 
codes, which is based on varying length multi-sequence shift- 
register synthesis. This algorithm allows for efficiently de- 
coding both homogeneous and heterogeneous IRS codes. We 
explained that homogeneous IRS constructions yield MDS 
codes with an optimum minimum distance, while heteroge- 
neous IRS code designs provide a high degree of freedom for 
constructing generalized concatenated codes and other special 
applications like the decoding of low rate Reed-Solomon 
codes as explained in [15]. 

The decoding of heterogeneous IRS codes is closely related 
to joint error and erasure correction. Hence, it is also possible 
to adopt our algorithm for performing joint error and erasure 
decoding. 

The decoder described by Algorithm 2 is able to correct all 
error patterns within a sphere of radius t g , where t g is half the 
minimum distance of the IRS code. Furthermore, it is able to 
correct errors beyond half the minimum distance, as long as 
the number of errors is below the maximum error correcting 
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radius 

tmax = Y^(N-K) . 

If the decoding radius is increased beyond half the minimum 
distance, it is in principle not possible to decode all error 
patterns uniquely, since the received vector may lie in a region, 
where several correction spheres are overlapping. In this case, 
three basic strategies are conceivable: the decoder selects one 
solution from the list, if several solutions exist; the decoder 
yields all possible solutions and leaves it to the succeeding 
data processing unit to cope with this list; or the decoder 
does not take a decision at all and yields a decoding failure. 
Our algorithm applies the third strategy. Whenever it is not 
able to find a unique solution for a received word with t 
errors, it declares a decoding failure. Clearly, this strategy 
makes only sense, if the failure probability is small enough, 
and hence most of the error patterns can be corrected in the 
range t g < t < t max - Therefore, we overbounded the failure 
probability Pf(t) and showed that it is in the order of -, 
for t — t max , and decreases exponentially with decreasing t. 
Besides of the failure probability Pf, we also overbounded 
the error probability P e to be able to analyze the overall 
performance of collaborative decoding, or more precisely the 
probability P w = P e + Pf for a wrong decoding result. 

Moreover, we considered concatenated code designs with 
outer IRS codes and inner block codes, since inner decoding 
causes column burst errors, which are necessary for effective 
collaborative decoding. We applied the bound on P e {t) and 
Pf(t) to analyze the decoding performance of such concate- 
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nated schemes. This analysis has been performed by deriving 
bounds on the word error probability after independent and 
collaborative outer decoding, and by using them to investi- 
gate, which decoding gains can be achieved by collaborative 
decoding in comparison to an independent decoding strategy. 
We complemented these considerations by Monte Carlo simu- 
lations for two specific code designs. We observed that for all 
operating points which could be interesting for practical appli- 
cations, we are able to achieve a collaborative decoding gain 
without increasing the decoding complexity in comparison to 
independent decoding. The probabilities P w achievable by our 
collaborative decoding strategy are 100-1000 times smaller 
than the probabilities achievable by independent decoding. 

The concatenated code designs considered here are rather 
simple, to be able to analyze them theoretically. However, 
since concatenated codes with interleaved Reed-Solomon 
codes are used in many applications and can be found in 
several standards, the basic principles discussed in this paper 
may also be applicable for such practical systems. 
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